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Abstract 

We  study  the  issue  of  designing  a  downlink  scheduling  policy  for  a  cellular  network  with  base 
station  antenna  arrays.  We  derive  an  optimal  scheduling  policy  that  achieves  the  throughput  region, 
which  is  a  set  of  feasible  arrival  rate  vectors  that  can  be  stabilized  by  some  scheduling  policy.  Then, 
based  on  the  structure  of  the  derived  optimal  policy  whose  complexity  increases  exponentially  with 
the  number  of  users  in  the  system,  we  propose  two  heuristic  scheduling  algorithms  with  much  lower 
complexity.  We  demonstrate  that  our  proposed  algorithms  perform  much  better  than  other  heuristic 
algorithms  that  do  not  take  into  consideration  the  physical  layer  constraints  and/or  queue  lengths  in  the 
sense  that  they  have  a  larger  throughput  region  than  other  heuristic  algorithms. 

Keywords  -  antenna  array,  cross-layer  design,  optimal  scheduling,  stability 

I.  Introduction 

Wireless  communication  has  been  going  through  a  rapid  transition  from  the  traditional  circuit 
switched  voice  services  to  packet  switched  data  services  during  the  past  decade.  Increasing 
demand  for  fast  wireless  access  and  high-speed  wireless  links  and  the  advent  of  wireless  appli¬ 
cations  such  as  wireless  multimedia  that  demands  certain  quality-of-service  (QoS)  guarantees, 
have  stimulated  much  research  in  wireless  communication. 
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New  network  architectures  and  protocols  are  proposed  to  support  data  applications  in  wireless 
networks.  A  typical  architecture  in  many  of  current  wireless  systems,  especially  cellular  networks 
(< e.g UMTS,  cdma  2000),  provides  wireless  access  to  mobile  users  through  access  points  (APs) 
or  base  stations  (BSs)  that  are  connected  to  the  core  network.  The  most  challenging  task  in 
designing  these  communication  systems  is  to  support  QoS  guarantees  to  various  data  applica¬ 
tions  on  wireless  channels  with  limited  bandwidth  and  time  varying  characteristics.  There  exist 
different  notions  of  QoS  at  different  communication  layers.  For  instance,  the  QoS  at  the  physical 
layer  may  be  given  as  certain  signal  to  interference  and  noise  ratio  (SINR)  or  a  corresponding 
bit  error  rate  (BER)  at  the  receiver.  At  the  MAC  layer  the  QoS  is  typically  expressed  in  terms  of 
achievable  bit  rate  or  packet  error  rate  (PER),  while  at  the  higher  layers  the  QoS  can  be  perceived 
as  a  minimum  throughput  and/or  maximum  delay  requirement.  The  ability  of  a  network  to  satisfy 
such  QoS  requirements  and  enhance  the  system  capacity  depends  on  the  interaction/cooperation 
of  several  layers.  As  a  result,  a  joint  design  of  protocols  for  two  or  more  layers  can  lead  to 
significant  improvement  in  overall  network  performance.  This  paper  provides  an  example  where 
a  joint  design  of  physical  layer  and  MAC  scheduling  algorithms  offers  a  significant  improvement 
in  network  throughput/capacity. 

A  wide  spectrum  of  approaches  have  been  proposed  to  reuse  the  communication  resources 
in  time,  frequency  and/or  space  domain,  to  provide  the  QoS  guarantees  to  mobile  users  and  to 
improve  wireless  network  capacity.  Among  these  approaches,  the  application  of  antenna  arrays, 
which  exploits  the  spatial  diversity  of  mobile  users,  is  considered  a  more  promising  one  and  the 
last  frontier  for  future  capacity  improvement  of  wireless  networks.  This  is  due  to  the  beamforming 
capability  of  the  antenna  arrays  that  can  form  the  beam  pattern  directed  to  a  desired  user  while 
nulling  out  the  others.  This  greatly  reduces  co-channel  interference  at  the  receiver(s),  and  spatially 
separable  users  can  share  the  same  channel  with  their  QoS  requirements  satisfied. 

Previous  research  on  application  of  antenna  arrays  in  cellular  environments  can  be  categorized 
into  two  classes.  The  first  class  of  research  is  on  the  physical  layer;  given  a  set  of  users,  the 
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problem  is  to  design  optimal  algorithms  to  calculate  the  beamforming  weights  for  each  user. 
The  problem  is  often  modeled  as  an  optimization  problem,  where  the  objective  is  to  minimize 
the  total  transmission  power  subject  to  the  minimum  SINR  constraint  for  each  user  [3],  [14]. 

The  second  class  of  research  is  on  the  MAC  layer  with  consideration  of  physical  layer  user 
separability  constraints.  The  goal  of  this  approach  is,  given  a  set  of  users,  to  place  as  many  users 
on  the  same  channel  as  possible  and  compute  the  beamforming  weights  for  each  selected  user 
subject  to  the  SINR  constraint.  This  helps  maximize  the  (instant)  throughput  of  the  network. 
This  problem  is  also  studied  with  other  multi-user  access  schemes  such  as  TDMA,  OFDM  and 
CDMA  [5],  All  of  these  studies,  however,  assume  that  users  are  modeled  as  infinite  sources 
where  there  is  always  a  packet  to  be  served  at  the  queue  for  each  user.  Major  drawbacks  of 
these  works  are  the  limitation  of  the  focus  on  instant  total  throughput  maximization  and  the  lack 
of  consideration  of  upper  layer  QoS  requirements  for  individual  users.  Thus,  the  assignment  of 
users  on  a  channel  only  reflects  the  feasibility  at  the  physical  layer,  but  not  the  current  buffer 
occupancy  or  traffic  demand  of  each  user.  This  separation  of  physical  layer  algorithms  and  upper 
layer  QoS  requirements  leads  to  degradation  of  overall  system  performance. 

In  this  paper  we  study  the  problem  of  designing  a  scheduling  algorithm  with  BSs  that  are 
equipped  with  an  antenna  array.  We  first  consider  the  case  where  a  central  controller  handles 
multiple  BSs  serving  a  set  of  users.  In  this  case  packets  arrive  at  the  central  controller  for 
transmission  to  mobile  users.  We  model  the  system  as  a  queueing  system  with  multiple  parallel 
servers,  and  the  physical  layer  constraints  are  imposed  on  the  selection  of  users  that  can  be 
served  in  each  timeslot.  Instead  of  a  policy  that  maximizes  instant  throughput,  we  identify  an 
optimal  scheduling  policy  that  stabilizes  the  system  if  it  is  stable  under  some  policy.  The  user 
throughput  requirements  are  satisfied  under  this  optimal  scheduling  policy  and,  hence,  the  long 
term  total  system  throughput  is  maximized. 

Similar  queueing  systems  have  been  used  to  model  other  scenarios  in  [1],  [6],  [10],  [19],  [20], 
and  were  first  proposed  in  [19]  for  a  multi-hop  radio  network.  They  derive  an  optimal  policy 
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of  similar  form  (based  on  maximum  weighted  queue  size).  However,  the  complexity  of  these 
optimal  scheduling  policies  increases  exponentially  with  the  number  of  users/queues,  and  no 
practical  sub-optimal  scheduling  policy  is  proposed.  In  this  paper,  we  follow  a  similar  approach 
as  in  [10],  and  propose  two  scheduling  policies  with  significantly  lower  complexity  that  achieve 
sub-optimal  performance  for  our  problem.  In  fact,  the  first  proposed  scheduling  algorithm  has 
linear  complexity. 

Some  of  our  preliminary  work  is  presented  in  [15].  However,  the  performance  evaluation 
reported  in  [15]  was  carried  out  using  a  simple  heuristic  beamforming  algorithm,  assuming  all 
transmissions  are  successful.  In  this  paper  we  use  an  optimal  beamforming  algorithm  proposed 
in  [17]  and  explicitly  model  the  events  of  successful  transmissions  based  on  the  achieved  SINR 
values  at  the  scheduled  users  and  link  curves.  Most  importantly,  in  [15]  we  consider  a  multiple 
cell  network  under  centralized  control  and  do  not  study  the  performance  of  our  algorithms  in 
multiple  cell  networks  where  resource  allocations  are  carried  out  by  the  BSs  in  a  distributed 
manner  without  coordination.  In  such  an  environment  the  inter-cell  interference  caused  by  other 
co-channel  cells  at  a  scheduled  user  is  unknown  in  advance.  As  a  result  the  traditional  approach 
of  aiming  to  achieve  some  target  SINR  at  the  scheduled  users,  which  is  the  same  approach  taken 
for  our  single  cell  network  study,  is  no  longer  feasible. 

Understanding  the  effects  of  the  unknown  inter-cell  interference  on  the  performance  of  the 
beamforming  and  scheduling  algorithms  is  crucial  as  many  of  the  wireless  service  providers 
are  currently  interested  in  moving  the  intelligence  from  a  central  resource  manager  (e.g.,  radio 
network  controller  (RNC))  to  the  edge  of  the  network  (e.g.,  BSs)  in  order  to  handle  the  increasing 
complexity  of  the  resource  allocation  algorithms  and  to  reduce  the  delays  in  channel  estimation 
between  the  BSs  and  the  central  resource  manager.  In  this  paper  we  adopt  a  beamforming 
algorithm  we  propose  in  [16]  (summarized  in  subsection  VI-B  in  this  paper).  This  algorithm  is 
a  modification  of  the  beamforming  algorithm  proposed  in  [17]  and  provides  guaranteed  average 
PERs  to  the  users  in  the  presence  of  unknown  inter-cell  interference.  Using  this  beamforming 
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algorithm  we  investigate  the  performance  of  our  proposed  scheduling  algorithms  and  the  degra¬ 
dation  in  network  performance  due  to  the  unknown  inter-cell  interference  in  Section  VII.  We 
also  carry  out  a  study  on  the  effects  of  the  inter-cell  interference  on  network  performance  and 
spectral  efficiency  under  different  reuse  patterns. 

This  paper  is  organized  as  follows.  In  Section  II  we  describe  the  problem  of  designing  an 
efficient  downlink  scheduling  algorithm  with  base  station  antenna  arrays,  and  derive  an  optimal 
scheduling  policy  based  on  feasible  rate  vectors.  In  Section  III  we  describe  the  single  cell  network 
model,  which  is  followed  by  our  proposed  heuristic  algorithms  that  approximate  the  optimal 
scheduling  policy  with  lower  complexity  in  Section  IV.  Simulation  results  of  the  proposed 
algorithms  in  a  single  cell  network  are  given  in  Section  V.  Section  VI  describes  a  multi¬ 
cell  network  model  and  beamforming  algorithms  that  handle  random  inter-cell  interference  in 
such  environments,  followed  by  simulation  results  of  the  proposed  algorithms  in  a  multi-cell 
environment  in  Section  VII.  We  conclude  in  Section  VIII. 

II.  Optimal  Downlink  Scheduling 

In  this  section  we  consider  the  case  where  a  centralized  control  agent  carries  out  scheduling 
and  beamforming  for  multiple  BSs  serving  a  set  of  users.  We  define  an  achievable  rate  vector  and 
a  throughput  region,  and  present  an  optimal  scheduling  algorithm  that  can  achieve  any  interior 
point  in  the  throughput  region. 

A.  System  model 

We  consider  a  wireless  network  that  consists  of  several  BSs.  Each  BS  is  equipped  with  an 
antenna  array  so  that  several  users  can  be  served  simultaneously.  These  BSs  are  coordinated  by 
a  single  central  controller.  Mobile  users  in  the  network  are  able  to  receive  data  packets  from 
any  of  these  BSs.  However,  at  any  given  time,  a  mobile  user  can  receive  data  packet(s)  from  at 
most  one  BS.  We  assume  a  time  slotted  system  where  the  transmission  time  of  a  packet  equals 
the  duration  of  a  timeslot  when  the  lowest  transmission  rate  is  selected.  In  each  timeslot,  the 
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central  controller  collects  the  information  regarding  the  wireless  channel  conditions  of  each  user 
to  different  BSs.  Based  on  this  information  and  the  number  of  backlogged  packets  of  each  user, 
the  central  controller  makes  a  scheduling  decision  for  the  timeslot.  The  scheduling  decision  made 
by  the  central  controller  includes  assignment  of  BSs  to  the  users  and  the  transmission  rate  of 
each  user,  and  the  calculation  of  the  beamforming  weights  for  the  selected  transmission  rates. 

The  system  architecture  under  study  is  depicted  in  Fig.  1.  User  packets  enter  the  scheduling 
module  at  the  central  controller,  which  determines  the  assignments  of  BSs  and  transmission  rates. 
Beamforming  and  power  adaptation  are  subsequently  carried  out  for  scheduled  users.  Scheduling 
and  beamforming  are  interdependent  operations  that  depend  on  network  state  ( i.e queue  sizes) 
and  channel  state  information,  which  is  assumed  to  be  available  at  the  central  controller. 


Fig.  1.  The  multiple  cellular  communication  system. 


B.  Problem  statement 

The  network  consists  of  I  BSs  shared  by  J  mobile  users.  We  denote  the  set  of  BSs  by 
X  =  and  the  set  of  users  by  J  =  {1,...,J}.  There  is  a  central  controller  that 
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coordinates  the  operation  of  the  I  BSs.  Each  BS  is  equipped  with  an  Af -element  antenna  array. 
Several  transmission  rates  are  available  at  the  BSs  based  on  the  channel  conditions.  The  set  of 
available  transmission  rates  is  denoted  by  V.  We  assume  that  each  transmission  rate  is  a  positive 
integer  number.  If  rate  v  G  V  is  chosen,  up  to  v  packets  can  be  transmitted  in  one  timeslot, 
depending  on  the  number  of  packets  waiting  for  transmission.  We  denote  |V|  =  V. 

Packets  arrive  at  the  central  controller  for  transmission.  The  central  controller  maintains  a 
separate  queue  for  each  user.  Let  =  1,2,  •••,./  and  t,  =  0,1,---,  denote  the  number 

of  packets  that  arrive  at  queue  j  in  timeslot  t.  We  assume  that  a,j(t),t  =  0, 1,  2, ... ,  are  given 
by  independent  and  identically  distributed  (i.i.d.)  random  variables  (rvs)  with  a  finite  second 
moment,  i.e.,  E  \a:l(t)2]  <  oo.1  The  average  arrival  rate  of  user  j ,  E  [aj{t)],  is  denoted  by  Ar 
We  call  the  vector  of  the  average  arrival  rates  of  the  users,  A  =  (Ax,  A2,  •  •  • ,  Aj)T,  an  arrival 
vector. 

We  assume  that  the  central  controller  has  perfect  channel  information  of  each  user  to  the  BSs. 
In  each  timeslot,  the  central  controller  (i)  assigns  the  BSs  to  the  users,  (ii)  computes  transmission 
rates  of  scheduled  users,  and  (iii)  calculates  the  beamforming  weights  of  the  scheduled  users. 
A  scheduling  decision  by  the  central  controller  can  be  expressed  as  an  I  x  J  matrix  R  =  [r,:)] 
where  the  element  rl3  G  V  U  {0},  i  =  1.  ■■•,/,  and  j  =  1,  •  •  • ,  J,  is  the  transmission  rate  of  BS 
i  to  user  j.  However,  a  rate  matrix  R  can  be  selected  for  transmission  by  the  central  controller 
only  if  it  satisfies  certain  physical  layer  constraints  described  below. 

For  data  traffic,  in  order  to  maintain  reasonable  performance  at  higher  layers  the  PER  at  the 
physical/link  layer  needs  to  be  kept  fairly  lower.  This  PER  requirement  demands  that  the  SINR 
at  each  receiver  be  above  some  threshold  value.  A  rate  matrix  is  feasible  if  and  only  if  (i)  each 
user  receives  packets  from  at  most  one  BS  and  (ii)  SINR  requirement  is  satisfied  for  every  user. 
Note  that  the  feasibility  of  a  rate  matrix  depends  on  the  target  PER  that  determines  the  SINR 

'The  results  in  this  paper  can  be  easily  extended  to  the  case  where  the  arrival  processes  are  given  by  an  ergodic  Markov 
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requirement  for  each  user. 

We  model  the  channel  process  for  all  users  as  a  finite  state  ergodic  Markov  chain  (MC)  with  a 
stationary  distribution  n.  Each  channel  state  is  associated  with  a  set  of  feasible  rate  matrices.  In 
other  words,  a  state  of  the  MC  specifies  the  set  of  all  feasible  rate  matrices  that  can  be  selected 
for  transmission  given  the  channel  conditions  of  the  users.  The  state  space  of  the  MC  is  denoted 
by  S.  The  problem  we  are  interested  in  is  one  of  finding  a  scheduling  policy  that  selects  a 
feasible  rate  matrix  given  the  queue  sizes  and  channel  state  in  each  timeslot,  so  that  the  system 
is  stable  whenever  possible  under  some  policy.  In  this  paper  we  only  consider  stationary  policies, 
i.e.,  the  scheduling  decisions  do  not  depend  on  timeslot  t,  but  only  on  the  channel  state  S (t)  and 
the  queue  sizes  X(t)  :=  (x\ (£), . . .  ,xj(t))T,  where  Xj(t)  is  the  number  of  backlogged  packets 
in  queue  j  at  the  beginning  of  timeslot  t.  A  stationary  scheduling  policy  can  be  viewed  as  a 
mapping  that  assigns  to  each  pair  (X,  S)  G  Z  |x5  of  queue  sizes  and  channel  state  a  probability 
distribution  on  the  set  of  feasible  rate  matrices  for  the  given  state  S,  where  Z+  :=  {0,1,...}, 
where  the  probability  assigned  to  a  feasible  rate  matrix  is  the  probability  the  policy  will  select 
the  rate  matrix  for  transmission  given  S (t)  and  X(f). 

C.  Throughput  region 

In  this  subsection  we  first  define  a  stable  arrival  vector  and  then  characterize  the  throughput 
region. 

Definition  1:  An  arrival  vector  A  is  said  to  be  stable  if  there  exists  a  scheduling  policy  such 
that 

1  f 

lim  lim  sup  -  ^  l[xj(r)>c]  =  0  ,  for  all  j  =  1,  2,  •  •  • ,  J  .  (1) 

c_>0°  t-nx>  t  T=1 

If  a  scheduling  algorithm  satisfies  (1),  then  we  say  that  A  is  stable  under  the  scheduling  policy. 
The  throughput  region,  denoted  by  A,  is  defined  to  be  the  closure  of  the  set  of  stable  arrival 
vectors. 

The  following  proposition  characterizes  the  throughput  region  A. 
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Proposition  1:  A  necessary  and  sufficient  condition  for  an  arrival  vector  A  to  belong  to  A, 
is  that  there  exists  a  scheduling  policy  that  achieves 

A  <  D  :=  ^  7Ts  csrR-T1/xi(1  —  PER )  (2) 

ses  Res 

where  csr,  S  e  *S,  R  G  S,  are  nonnegative  numbers  such  that  Ergs  csr  =  1  for  all  S  G  5. 
Proof:  A  proof  is  provided  in  Appendix  I  ■ 

D.  Optimal  scheduling  policy 

In  this  subsection  we  describe  an  optimal  scheduling  policy  that  satisfies  (1)  for  every  A  £ 
int(A),  where  int(A)  denotes  the  interior  of  the  throughput  region  A.  In  particular,  we  consider 
the  following  scheduling  policy:  Given  backlog  vector  X(f)  and  channel  state  S (/) ,  the  rate 
matrix  selected  by  the  scheduling  algorithm  is  given  by 

R (t)  =  arg  max  X(t)T(RT  l/xl)  •  (3) 

Res(t) 

Ties  are  assumed  to  be  broken  arbitrarily. 

The  following  proposition  establishes  the  (throughput)  optimality  of  the  scheduling  policy 
given  by  (3). 

Proposition  2:  Suppose  that  A  £  int(A).  Then,  the  system  is  stable  under  the  scheduling 
policy  given  by  (3). 

Proof:  A  proof  is  given  in  Appendix  II.  ■ 

In  fact,  in  the  proof  we  prove  that  any  stationary  scheduling  policy  that  selects  the  rate  matrix 
arg  maxRcS(()  X(t)T2(RT  l/xl),  where  S  =  diag(£j,  j  £  J)  and  fj  >  0,  is  a  throughput  optimal 
policy  and  the  system  is  stable  if  the  arrival  vector  A  £  int(A). 

In  this  section  we  have  considered  scenarios  where  a  centralized  controller  carries  out  the 
scheduling  and  beamforming  for  multiple  BSs  serving  a  fixed  set  of  users,  and  derived  an 
optimal  scheduling  policy.  The  derived  optimal  scheduling  policy,  however,  does  not  yield  a 


10 


practical  implementation  as  it  requires  searching  through  all  feasible  rate  matrices  given  the 
current  channel  state  and  identifying  the  one  that  maximizes  the  inner  product  given  in  (3). 

A  natural  question  that  arises  is  how  one  can  design  more  practical  scheduling  algorithms 
based  on  the  optimal  scheduling  policy.  In  the  following  sections,  we  assume  that  each  user  is 
associated  with  the  closest  BS,  i.e.,  static  BS  assignment,  and  investigate  the  issue  of  designing 
practical  scheduling  algorithms  with  an  antenna  array  at  the  BS(s).  We  will  first  consider  a  simple 
case  of  single  cell  networks  in  Sections  III  and  V,  and  then  discuss  multiple-cell  environments 
in  Sections  VI  and  VII.  In  the  case  of  a  multiple-cell  network,  a  receiving  user  experiences  inter¬ 
cell  interference  from  co-channel  cell  BSs  that  share  the  same  frequency  spectrum.  Therefore,  a 
BS  needs  to  compensate  for  this  random  inter-cell  interference  experienced  by  a  receiver  when 
computing  beamforming  weights  and  transmission  power  of  a  scheduled  user.  This  issue  will  be 
discussed  in  Section  VI. 


III.  Single  Cell  Network  Model 

In  this  section  we  first  describe  the  channel  models  used  in  our  study  and  then  present  the 
optimal  beamforming  algorithms  to  be  used  in  our  study  of  single  cell  networks. 

A.  Single  cell  channel  model 

In  this  subsection  we  describe  the  wireless  channel  model  for  a  single  cell  network  with  a  cell 
radius  of  R.  A  BS  is  located  at  the  center  of  the  cell  and  transmits  packets  to  N  users  uniformly 
distributed  in  the  cell.  In  each  timeslot,  the  BS  schedules  a  set  of  users  for  transmission,  and 
calculates  their  beamforming  weights  and  transmission  powers. 

The  BS  is  equipped  with  an  antenna  array,  where  M  antenna  elements  are  uniformly  located 
on  a  circle  of  radius  r.  The  multi-path  channel  between  a  given  user  and  the  m-th  antenna 
element  of  the  BS  is  expressed  as  hm(t )  =  Y.'-\  9t  S  (t  —  t?  —  t["),  where  L  is  the  number  of 
paths,  gt  is  the  complex  gain  of  the  /;-th  path,  and  t>  is  the  delay  of  the  f-th  path  with  respect 
to  the  first  antenna  element  with  m  =  1.  The  gain  gt  is  a  complex  rv  with  zero  mean  and 
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variance  A(i.  The  term  rf  =  (r/c) (cos ^  —  cos(27r(m  —  1  )/M  —  Of})  captures  the  delay  to  the 
m-th  antenna  with  respect  to  the  first  antenna,  and  Of  is  the  angle  of  arrival  of  the  f-th  path  of 
the  user,  and  c  is  the  electro-magnetic  wave  propagation  speed.  We  assume  each  path  results 
from  the  reflection  by  a  scatterer,  and  L  scatterers  are  distributed  within  a  circle  of  radius  R1 
centered  at  the  user  and  are  uniformly  distributed  both  in  distance  and  in  angle  with  reference 
to  the  user. 

The  signal  received  by  the  user  from  the  BS  is  given  by 

M  L 

y{t)  =  V?E  w™  5Z  9e  eJUJT™s{t  -  re)  , 

m= 1  1=1 


where  p  is  the  transmission  power,  wm  is  the  beamforming  weight  of  the  m-th  antenna  element, 
and  s(t)  is  the  signal.  Note  that  we  implicitly  assume  that  77  t["  and  the  sum  n  +  t?  n- 

Beamforming  vector  w  =  [w1,  w2,  ■  ■  ■ ,  wM]T  satisfies  w/Jw  =  1.  The  M  x  1  antenna  steering 
vector  v(0f)  at  direction  Of  is  defined  to  be  [e-7“T™;  m  =  1 M],  where  is  the  carrier 
frequency.  If  we  assume  that  all  paths  are  independent,  the  expected  received  signal  power  is 
given  by 


E 


M  L 

ej^ms(t-r£) 

m=  1  £=1 


=  pwHRw  , 


where  R  =  YJt=\  Aev(9e)vH (9e).  This  is  because  E  i/|2j  equals  A(  if  ix  =  l2  and  0  otherwise 
from  the  assumption  E  [fp]  =  0.  The  matrix  R  is  called  a  spatial  covariance  matrix  and  in  general 
has  a  rank  larger  than  one. 

For  time  varying  channels,  the  variance  of  the  channel  gain  {Ae(t)]  t  =  1,  2, . . .}  is  a  stochastic 
process.  We  consider  temporally  correlated  shadow  fading  plus  Rayleigh  fading  channel  model 
in  this  paper.  The  rv  Af(t)  =  (sl(t)ff  (i))-/d>f  where  sf  (i)  and  ff(1)  are  sequences  of  log-normal 
and  Rayleigh  rvs,  respectively,  accounting  for  slow  shadow  fading  and  fast  fading.  The  variable 
df  denotes  the  distance  from  the  BS  to  the  user  along  the  f-th  path  and  k  is  the  path  loss 
exponent.  The  sequence  of  rvs  { ( t ) }  =  {e'7^},  where  {rf(f)}  is  a  sequence  of  Gaussian  rvs 
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generated  from  re(t  +  1)  =  (1  —  p)rg(t )  +  p  ■  ut(t)  using  i.i.d.  Gaussian  rvs  {ut(t)}  with  zero 
mean,  and  is  a  sequence  of  i.i.d.  Rayleigh  rvs. 

We  denote  the  spatial  covariance  matrix  of  user  j  as  'Hj.  The  SINR  of  user  j,  denoted  by 
SINRj,  is  given  by  Sj/(  +  n2),  where  Sj  and  are  the  signal  power  and  intra-cell 
interference  received  by  user  j,  respectively,  and  n2  is  the  noise  power  at  user  j.  Let  J\f  be  the 
set  of  co-cell  users.  Then,  the  signal  power  Sj  and  the  intra-cell  interference  /'"/r"  are  given  by 

Sj  =  Pj  (wf  • Hjvrj)  and  lftra  =  ^  pk  (wf  ^-wfc)  , 

keAf 

kdj 

where  pj  and  w?  are  transmission  power  and  beamforming  vector,  respectively,  for  user  j  £  J\f. 


B.  Optimal  beamforming  algorithms 

In  this  subsection  we  briefly  summarize  a  downlink  beamforming  algorithm  proposed  in  [17]. 
This  algorithm  equalizes  the  ratios  of  the  achieved  SINRs  to  some  target  SINRs  (called  relative 
SINRs)  in  a  single  cell  under  the  assumption  that  the  noise  power  at  each  user  is  constant 
and  is  available  to  the  BS.  It  consists  of  two  phases;  in  the  first  phase,  the  minimum  relative 
SINR  among  the  scheduled  users  is  maximized  (Algorithm  I).  This  is  equivalent  to  finding  the 
largest  common  relative  SINR  p*  under  a  power  budget  constraint  ||p||i  =  Pmnx,  where  p  is  the 
transmission  power  vector.  Here  ||  •  ||i  denotes  L 1  norm.  Let  W  =  { w; , ;j  £  U  }  be  the  ensemble 
of  beamforming  vectors,  where  U  is  the  set  of  scheduled  users  and  \U\  =  U,  and  7 j  denotes  the 
target  SINR  of  user  j.  A  set  of  users  can  be  scheduled  with  their  respective  SINR  requirement 
satisfied  if  tfr  >  1,  and  a  set  of  users  that  satisfies  this  condition  is  called  a  feasible  set. 


Algorithm  I:  FEASiBiLiTY(Pm<la:;  • ,  Vj  e  U) 

STEP  1:  Set  n  =  0,  q(0>  =  [0,  •  •  • ,  0]T,  and  A™*  =  oo. 

STEP  2:  While  1,  do 

•  Set  n  <—  n  +  1.  Solve  a  set  of  U  generalized  eigenproblems: 


(n) 

wj.  =  arg 


1  =  1  wJ 


'N-'.<rrt  1  -iwj 


Vjeu 


(4) 
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where  'Hj  =  Hj  /nj.  and  Rj(qO  1')  =  ^^keu  k^j  ^  +  /  with  /  being  the  M  X  M  identity  matrix.  The  solutions  to 

the  above  generalized  eigenproblems  are  given  by  the  dominant  generalized  eigenvectors  of  the  matrix  pairs  fij,  R  f  ( q1" ~ '  1 )]  for  all 

i  c  u. 

Find  the  largest  eigenvalue  A of  A fW'  r' f ,  Pmax)  and  the  corresponding  eigenvector  q[”[  of  the  form  (iff)  =  [qAl ;  1],  where 


A("W,  PmaJl  — 


D(W)®T(W)  D(W)cr 

7v^j1tD(W)'I't(W)  ■p^jiTD(W)o- 

I  =  [1  •  •  •  1]T,  D(W)  =  diag{7i/(wf  %wi),  ■  ■  ^yu/(w§ 'Hu™u)}-  .a  =  •  ,r»y]T,  ’E(W)  =  G  U],  and  the 

interference  caused  by  user  j  to  user  i  per  unit  power  t/{j  =  w(  7A,  w  ,.  if  i  j  and  ipjj  =  0  if  i  =  j. 

Tf  _  \(")  <"  p  hrpak 

II  ^ max  * max  _  DiedK. 


STEP  3:  If  A! 


(n) 


<  1,  i.e.,  the  set  of  users  can  be  scheduled  in  the  same  timeslot,  output  1.  Otherwise,  output  0. 


It  is  shown  [17]  that  the  sequence  of  eigenvalues  { }  is  monotonically  decreasing  and 
converges  to  the  global  minimum  \*max,  which  is  related  to  the  maximum  common  SINR  ratio 
Vc  by  the  relation  rfc  =  ( nmi  A(  W.  Pmax)))  1  =  (A^ax)_1. 

The  second  phase  of  the  algorithm  attempts  to  minimize  the  total  transmission  power  subject 
to  SINR  requirement  given  a  feasible  set  (Algorithm  II). 

Algorithm  II:  MiNiMizE_powER(Pm<13.;  Vj  G  U) 

STEP  1:  Set  n  =  0,  and  q(°)  =  [0,  •  •  • ,  0]T. 

STEP  2:  While  1,  do 

•  Set  n  <—  n  +  1.  Solve  a  set  of  U  generalized  eigenproblems  defined  in  (4). 

•  Compute  qO>  =  (I  -  D(W(',A^,T(W(,,A)_1D(W(n))l- 

•  If  ||q("A)||i  —  ||qW||i  <  £,  break. 

STEP  3:  Compute  the  optimal  downlink  transmission  power  vector  given  by  pfO  =  (I  —  D(W^"A^,(W^n)))_1D(Wf"')l.. 

STEP  4:  Output  p(«),  W(n\ 


IV.  Proposed  Algorithms 

If  the  user  channels  are  time  invariant,  i.e.,  constant,  the  optimal  policy  described  in  Section  II- 
D  can  be  adopted.  In  other  words,  each  BS  may  be  able  to  exhaustively  search  through  all  possible 
transmission  rate  vectors  off-line,  and  select  the  solution  to  (3)  in  each  timeslot.  However,  when 
channels  vary  with  time,  this  exhaustive  search  becomes  too  computationally  expensive  and 
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impractical,  if  not  impossible.  This  is  because  the  number  of  possible  rate  vectors  is  given  by 

c0  £('j)r/  I-  (5) 

which  increases  exponentially  with  the  number  of  co-cell  users  N.  Hence,  we  turn  to  the  problem 
of  designing  heuristic  algorithms  that  will  perform  well  and  demand  much  lower  computational 
requirement. 

A.  First  Proposed  Algorithm 

Although  the  optimal  policy  in  (3)  does  not  lead  to  a  practical  algorithm,  it  suggests  that  a  good 
policy  should  attempt  to  give  higher  priority  to  users  with  larger  queue  sizes.  This  observation 
is  intuitive  in  the  sense  that  in  order  to  maintain  the  stability  of  the  system,  there  should  be 
a  balance  between  (i)  maximizing  instantaneous  system  throughput  and  (ii)  keeping  the  queue 
sizes  from  growing  without  a  bound.  Therefore,  the  optimal  policy  considers  the  inner  product 
of  two  vectors,  namely  X(i)  and  RT1/Xi,  where  the  first  term  is  the  queue  size  vector  and  the 
latter  represents  the  transmission  rate  of  each  user. 

A  heuristic  algorithm  that  attempts  to  mimic  the  behavior  of  the  optimal  policy  can  order 
the  users  based  on  either  (i)  the  transmission  rates  they  can  achieve  given  the  current  channel 
state  or  (ii)  queue  sizes.  The  first  approach  is  problematic  as  the  achievable  transmission  rates 
of  the  users  depend  on  the  set  of  scheduled  users  and  it  requires  searching  through  all  possible 
feasible  rate  vectors.  Hence,  in  our  first  heuristic  algorithm  we  order  the  users  according  to 
their  queue  sizes  and  give  higher  priority  to  users  with  a  larger  queue.  More  specifically,  the 
algorithm  starts  with  the  user  with  the  longest  queue,  and  tries  to  schedule  the  users  sequentially 
in  decreasing  order  of  their  queue  lengths.  Each  new  user  is  allocated  the  highest  possible  rate 
such  that  the  SINR  requirement  is  satisfied  with  the  new  rate  vector.  However,  when  we  insert 
users  into  the  channel  sequentially  according  to  their  queue  lengths,  it  is  possible  that  a  user 
already  scheduled  for  transmission  prevents  a  number  of  other  users  from  accessing  the  channel 
because  the  necessary  spatial  separability  cannot  be  provided.  Therefore,  in  order  to  improve 
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the  performance  of  the  system  further  and  maintain  linear  complexity,  we  will  consider  several 
candidate  rate  vectors  and  select  the  one  that  maximizes  (3).  More  specifically,  we  will  consider 
P  rate  vectors  out  of  all  possible  rate  vectors.  Clearly,  this  subset  of  candidate  rate  vectors  should 
consist  of  the  rate  vectors  that  are  more  likely  to  maximize  (3). 

We  explain  how  we  generate  this  subset  of  candidate  rate  vectors  to  be  considered.  Suppose 
that  we  form  an  ordered  list  of  users  by  decreasing  queue  size.  In  order  to  generate  the  p-th 
candidate  rate  vector,  p  =  1.  .. ...  P,  of  the  subset,  we  first  move  the  p-th  user  in  the  list  to  the 
head  of  the  list.  Then,  starting  from  the  head  of  the  list,  go  down  the  list  sequentially  and  insert 
one  user  at  a  time  using  the  largest  rate  that  is  allowed  while  maintaining  the  rates  and  required 
SINR  values  of  the  previously  scheduled  users.  Note  that  in  some  cases,  a  user  may  need  to  be 
skipped  because  the  user  may  not  be  compatible  with  other  users  already  scheduled.  Once  the  P 
candidate  rate  vectors  are  generated,  out  of  these  rate  vectors  we  select  the  one  that  maximizes 
(3).  The  pseudo-code  of  this  algorithm  is  provided  below. 


Algorithm  III:  heuristic_1(  xj(t),  Vj  e  A f) 

STEP  1:  Initialize  7Z  =  0. 

STEP  2:  For  p  =  1  to  P,  do 

•  Form  a  list  K  of  users  as  follows:  Insert  the  user  with  the  p- th  largest  queue  size  at  the  head  of  the  list,  and  insert  the  remaining  users 
by  decreasing  queue  size. 

•  Initialize  the  rate  vector  r  =  _0  and  the  set  of  scheduled  users  IA  =  0. 

•  While  \K\  ^  0,  do 

o  flag  =  0. 

o  Schedule  the  user  at  the  head  of  the  list,  denoted  by  f ,  K.  =  K,  \  {j*},U  =  U  U  {j*}  and  Vi  =  V 
o  While  Vi  7^  0  do 

*  vm  =  max{Vi },  rj*  =  %,  Vi  =  Vi  \  {vm}. 

*  If  FEASIBILITY(Pmajc;  Vj  G  W)  =  1,  flag  =  1  where  'yj  is  the  target  SINR  for  Wj  E  U,  break; 

o  If  flag  =  0,  rj*  =  0  and  U  =  U\  {j*}. 

•  71  =  7Z  Ur. 

STEP  3:  Among  the  rate  vectors  in  7Z,  select  r0  =  argmaxr^7^  i 
STEP  4:  MINlMIZE_POWER(Pmaa;;/Hj,n|,7j,Vj  E  U) 
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The  complexity  of  HEURISTIC_1  scheduling  algorithm  is  O(PNV),  and  hence  it  increases 
linearly  with  both  the  number  of  candidate  rate  vectors  P  and  the  number  of  users  N. 

B.  Second  Proposed  Algorithm 

The  optimal  scheduling  policy  (3)  is  not  suitable  for  implementation  because  its  complexity 
increases  exponentially  with  the  number  of  users  as  mentioned  earlier  (see  eq.  (5)).  However,  in 
order  to  prevent  the  complexity  from  increasing  with  the  number  of  users,  in  each  timeslot  an 
algorithm  can  first  choose  a  subset  of  a  fixed  number  of  users  to  be  considered  for  scheduling 
in  the  timeslot  and  then  carry  out  the  exhaustive  search  in  (3)  on  the  selected  users.  In  other 
words,  the  algorithm  will  mimic  the  behavior  of  the  optimal  policy  on  a  smaller  set  of  users 
that  are  selected  and  do  not  consider  the  remaining  users  for  scheduling  in  the  timeslot.  To  be 
consistent  with  the  observation  that  a  higher  priority  should  be  given  to  users  with  larger  queue 
sizes,  we  select  K  users  with  largest  queue  lengths.  Then,  an  exhaustive  search  is  conducted 
for  all  the  feasible  rate  vectors  on  this  subset  of  users,  and  the  rate  vector  that  maximizes  (3)  is 
selected.  A  pseudo-code  of  the  proposed  algorithm  is  provided  below. 


Algorithm  IV:  heuristic_2(  xj(t),  Vj  e  A f) 

STEP  1:  Initialize  R  =  0. 

STEP  2:  Select  the  K  users  with  largest  queue  lengths.  We  denote  this  set  of  users  as  K. 
STEP  3:  Let  S  —  {r  :  rj  E  V  U  {0}  if  j  E  K,  rj  =  0  if  j  K,  }. 

STEP  4:  For  each  r  E  S  do 

•  U  =  {j  ■  j  E  K.,rj  >  0}. 

•  If  FEASlBlLlTY(Pmol.;  7j ,  Vj  6U)  =  l,R=RUr. 

STEP  5:  Among  the  rate  vectors  in  1Z,  select  r0  =  argmaxrg7j  P.fi,  rjxjd)- 

STEP  6:  MINIMIZEJ>OWER(Pma3.;'H.j,n|,7j,Vj  E  U) 


The  complexity  of  HEURISTIC_2  algorithm  is  0(Ydk=i  ( l )  Vk )  =  0((V  +  1)A),  and  thus  the 


complexity  of  the  algorithm  increases  exponentially  with  the  size  of  the  subset  K . 
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V.  Performance  evaluation  of  a  single-cell  network 

In  this  section,  we  evaluate  the  performance  of  the  proposed  heuristic  scheduling  algorithms  in 
a  single  cell  network,  using  simulations.  We  first  describe  the  simulation  setup,  and  then  present 
the  numerical  results. 

A.  Simulation  setup 

1)  Wireless  channel  model:  The  parameters  used  in  the  simulation  for  modeling  wireless 
channels  are  listed  in  Table  I,  where  A  is  the  wavelength  of  the  carrier  electro-magnetic  wave. 


Pmax 

1015 

PER. target 

2% 

N 

10 

M 

4 

r 

A 

R 

1000 

R! 

100 

L 

6 

K 

3.5 

P 

0.1 

E  [«/(£)] 

0 

E[rfit)2 3]  -EM<)]2 

1.07 

E  [Mtf] 

1 

B[ft(t)2]  —  E[/,(I)]2 

1 

TABLE  I 

Parameters  used  in  performance  evaluation  of  scheduling  algorithms 


2)  Transmission  rate:  We  assume  that  the  BS  can  transmit  packets  to  a  user  using  either  a 
low  transmission  rate  or  a  high  transmission  rate.  When  a  low  (resp.  high)  transmission  rate  is 
used,  one  (resp.  two)  packet  is  transmitted  in  a  timeslot.  The  adopted  link  curves  for  both  the 
low  and  high  transmission  rates  are  shown  in  Fig.  2.  These  link  curves  give  the  probability  of 
a  packet  transmission  resulting  in  error  as  a  function  of  achieved  SINR,  i.e.,  PER. 

3)  Traffic  load:  In  our  experiment,  we  generate  an  N  x  1  random  vector  a.  The  arrival  rate 
vector  A  =  s  ■  a  where  s  is  a  parameter  used  to  scale  the  arrival  rate.  Traffic  load  is  defined  as 
1 1 A| 1 1  =  Y.'i-i  A/.  Since  it  is  difficult  to  characterize  the  throughput  region  using  simulations, 
instead  we  observe  the  average  delay  experienced  by  the  packets  with  increasing  traffic  load. 
Typically  when  the  system  loses  stability  there  is  a  sharp  increase  in  the  average  delay  at  some 
point  (called  the  knee)  due  to  the  instability. 
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Fig.  2.  The  linkcurves  for  low  and  high  transmission  rates 


B.  Numerical  results 
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Fig.  3.  Average  packet  delay  vs.  traffic  load  for  HEURISTICLl  algorithm  for  a  single  cell 


In  Fig.  3  we  show  the  performance  of  HEURISTIC- 1  algorithm  for  different  values  of  parameter 
P.  Note  that  the  average  delay  experienced  by  packets  is  similar  for  P  =  1  and  2.  This  is  because 
users  are  more  spatially  separable  in  a  single  cell  network  when  there  is  no  inter-cell  interference, 
and  most  of  the  times  first  two  users  with  the  largest  queues  can  be  scheduled  with  the  high 
transmission  rate  (2  packets).  This  can  be  seen  from  Fig.  3  since  the  knee  lies  right  around  traffic 
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load  of  6  (larger  than  4  =  2  users  x  2  packets).  Hence,  changing  the  order  of  the  first  two  users 
with  the  largest  queue  sizes  yields  similar  rate  vectors  and  does  little  to  increase  the  number 
of  distinct  candidate  rate  vectors.  As  a  result  increasing  the  number  of  candidate  rate  vectors 
from  1  to  2  does  little  to  reduce  the  average  delay.  However,  when  P  is  further  increased  to  3, 
the  maximum  stable  traffic  load  (or  the  knee)  increases.  This  is  due  to  the  fact  that  the  subset 
of  candidate  rate  vectors  expands  when  we  first  schedule  the  user  with  the  third  largest  queue, 
which  otherwise  may  not  be  scheduled  with  high  transmission  rate  (2  packets)  when  the  users 
are  considered  in  the  order  of  their  queue  sizes.  This  can  be  inferred  from  the  fact  that  with 
P  =  1  or  2  the  knee  lies  slightly  to  the  left  of  6  in  Fig.  3,  indicating  that  the  third  user  may  not 
always  be  scheduled  with  high  transmission  rate,  especially  when  a  fourth  user  is  also  scheduled. 
Hence,  this  introduces  an  opportunity  to  generate  a  candidate  rate  vector  quite  different  from 
those  generated  by  considering  only  the  first  two  users  with  high  transmission  rate.  For  a  similar 
reason,  the  maximum  stable  traffic  load  increases  with  P  =  4,  5,  and  6  (compared  to  P  =  3). 
When  the  value  of  P  is  increased  beyond  6,  the  average  delay  (and  hence  the  maximum  traffic 
load  that  can  be  handled  without  losing  stability)  does  not  change  much.  We  suspect  that  this 
is  due  to  the  fact  that  the  additional  candidate  rate  vectors  generated  schedule  a  user  with  a 
much  smaller  queue  size  and  hence  the  inner  products  in  (3)  of  these  additional  candidate  rate 
vectors  are  smaller  than  those  of  previously  generated  candidate  rate  vectors  that  schedule  users 
with  larger  queue  sizes.  The  maximum  stable  traffic  load  of  HEURISTIC _1  scheduling  algorithm 
with  P  =  1  (resp.  P  =  6)  is  approximately  91  percent  (resp.  96  percent)  of  that  of  the  optimal 
scheduling  algorithm  as  shown  in  the  figure.2 

For  the  purpose  of  comparison,  we  also  evaluate  the  performance  of  two  other  algorithms 
that  do  not  use  the  queue  length  information.  The  first  algorithm,  called  HEURISTIC  _3,  assigns 
a  credit  Cj  to  each  user,  for  example,  based  on  the  class  of  service  requested  by  the  user,  and 

"Although  the  plotted  average  delay  of  the  optimal  policy  does  not  increase  significantly  after  the  load  of  6.7,  this  is  due  to 
the  limited  simulation  run,  and  increasing  the  simulation  duration  results  in  much  larger  delays. 
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keeps  track  of  the  throughput  Tj(t)  of  the  users.  In  each  timeslot  the  users  are  scheduled  in 
increasing  order  of  their  ratios  of  throughput  to  credit  Tj(t)/cr  Since  the  user  with  the  smallest 
normalized  throughput  Tj(t)/cj  is  selected  for  scheduling  in  each  iteration,  the  algorithm  neither 
takes  the  queue  sizes  into  account  nor  attempts  to  maximize  the  instant  system  throughput.  In 
the  simulation  the  credits  of  the  users  are  set  to  their  arrival  rates. 

The  second  algorithm,  called  HEURISTIC  _4,  only  considers  the  physical  layer  separability 
of  the  users  and  attempts  to  maximize  the  instantaneous  throughput  of  the  system  in  each 
timeslot  without  considering  the  queue  sizes  [5],  Users  are  scheduled  in  the  following  sequential 
manner:  In  each  iteration,  starting  with  the  highest  transmission  rate,  for  each  unscheduled 
user  we  compute  the  common  relative  SINR  of  the  already  scheduled  users  and  the  user  under 
consideration  (using  Algorithm  I).  If  there  is  at  least  one  user  that  can  be  scheduled  at  the  selected 
rate,  we  select  the  user  that  maximizes  the  common  relative  SINR.  Otherwise,  we  reduce  the 
transmission  rate  and  repeat  the  procedure.  If  no  user  can  be  scheduled,  we  stop.  The  basic 
idea  behind  this  algorithm  is  to  schedule  a  user  that  is  most  likely  to  allow  more  users  to  be 
scheduled  in  subsequent  iterations. 

The  complexity  of  HEURISTIC_4  algorithm  is  O(MNV)  when  the  algorithm  is  terminated  after 
scheduling  at  most  M  users.  However,  in  practice  the  computational  requirement  of  HEURISTIC _4 
algorithm  is  much  higher  than  that  of  HEURISTIC  _1  algorithm.  This  is  due  to  the  fact  that  unlike 
HEURISTIC _4,  HEURISTIC _1  algorithm  typically  does  not  need  to  search  through  all  eligible 
users  for  generating  a  candidate  vector  and  searching  through  4-5  users  (instead  of  N)  is  often 
sufficient.  Hence,  when  N  is  large,  HEURISTIC- 1  algorithm  requires  much  lower  computation. 

The  average  packet  delay  under  these  algorithms  is  plotted  in  Fig.  3  as  well.  As  one  can  see 
from  the  plot,  the  maximum  traffic  load  that  can  be  accommodated  under  HEURISTIC _3  algorithm 
is  only  about  55  percent  of  that  of  the  optimal  scheduling  policy,  which  is  considerably  smaller 
than  that  of  HEURISTIC- 1  algorithm.  The  performance  of  HEURISTIC _4  is  better  than  that  of 
HEURISTIC_3,  but  is  considerably  worse  than  that  of  HEURISTIC _1  algorithm  with  P  =  4  despite 
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its  higher  computational  complexity.  The  poor  performance  of  these  algorithms  is  due  to  the  fact 
that  the  current  queue  lengths  of  the  users  are  not  considered  for  scheduling. 


Fig.  4.  Average  packet  delay  vs.  traffic  load  for  HEURISTIC_2  algorithm  for  a  single  cell 


We  also  evaluate  the  performance  of  HEURISTIC_2  algorithm,  which  is  plotted  in  Fig.  4.  As 
expected,  the  maximum  stable  traffic  load  increases  with  K,  the  size  of  the  subset  of  users 
considered  for  scheduling  in  each  timeslot.  Since  there  are  10  users  in  the  system,  the  algorithm 
with  K  =  10  is  the  optimal  policy  because  all  feasible  rate  vectors  formed  by  all  users  are 
considered.  The  maximum  stable  traffic  load  for  K  =  4  is  about  87  percent  of  that  of  the 
optimal  scheduling  policy  in  this  example. 

One  thing  to  keep  in  mind  is  that  since  HEURISTIC  _2  algorithm  considers  all  possible  rate 
vectors  with  a  subset  of  K  users,  the  complexity  of  the  algorithm  even  for  K  =  5  is  higher  than 
that  of  HEURISTIC- 1  algorithm  with  P  =  6.  Therefore,  these  simulation  results  of  a  single  cell 
network  indicate  that  HEURISTIC_1  algorithm  may  be  preferable  to  HEURISTIC _2  algorithm  as 
it  outperforms  HEURISTIC _2  algorithm  with  lower  complexity. 

VI.  Multiple  cells  network  model 

In  the  previous  sections  we  have  considered  a  simple  case  where  there  is  only  one  cell  in 
a  network.  In  practice,  in  order  to  improve  the  spectral  efficiency  of  the  cellular  system,  the 
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available  spectrum  is  reused  in  multiple  cells,  which  are  called  co-channel  cells.  Since  the  same 
frequency  band  is  used  in  more  than  one  cell,  these  co-channel  cells  cause  inter-cell  interference 
at  the  users  in  other  cells.  Therefore,  for  a  thorough  evaluation  of  the  performance  of  our  proposed 
algorithms,  we  need  to  evaluate  their  performance  in  the  presence  of  inter-cell  interference  that 
reduces  the  spatial  separability  of  the  users  as  will  be  shown  shortly. 

In  this  section  we  first  describe  the  network  model  with  multiple  cells  that  we  use  for  our 
study  (subsection  VTA),  and  then  introduce  a  beamforming  algorithm  we  proposed  in  [16]  for 
computing  the  beamforming  weights  and  transmission  powers  of  scheduled  users  in  the  presence 
of  random  inter-cell  interference  (subsection  VI-B). 

A.  Multi-cell  channel  model 

We  consider  a  network  consisting  of  7  co-channel  cells  shown  as  the  shaded  cells  in  Fig.  5.  The 
co-channel  cells  of  a  cell  can  be  found  by  (1)  moving  x  cells  along  any  chain  of  hexagons,  (2) 
turning  60  degrees  counter-clockwise,  and  (3)  moving  y  cells  ([13,  p.  28]).  A  total  of  x2+x-y+y2 
cells  share  the  available  spectrum.  We  call  this  pair  (x,  y )  the  reuse  pattern.  The  radius  of  a  cell 
is  denoted  by  R.  One  BS  is  located  at  the  center  of  a  cell. 


Fig.  5.  Co-channel  cells  (x  =  2  and  y  =  1). 


The  channel  model  we  adopt  for  a  user  in  a  cell  to  each  BS  (including  co-channel  cell 
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BSs)  is  the  same  as  the  channel  model  used  in  the  single  cell  scenarios,  which  is  described 
in  subsection  III- A.  We  denote  the  spatial  covariance  matrix  of  user  j  with  respect  to  BS  b  as 
'Hbj  and  the  BS  assigned  to  user  j  as  bj.  The  SINR  of  user  j,  denoted  by  SINRj,  is  given  by 
Sj/(Ijntra  |  +  r ij),  where  Sj,  7jntra,  and  IJvter  are  the  signal  power,  intra-cell  interference, 

and  inter-cell  interference  received  by  user  j,  respectively,  and  nj  is  the  noise  power  at  user  j. 
Let  W  be  the  set  of  users  in  the  7  co-channel  cells  we  consider.  For  each  user  j , 

Sj  =  Pj  ,  If  tra  =  Y  Pk  (wf ' ,  and  =  Y  Pk  (wf 

keu*  keu* 

k^j,bk=bj  k^j,bk^bj 

where  p,  and  wy  are  the  transmission  power  and  beamforming  vector,  respectively,  for  user 

3  e  U\ 

As  mentioned  at  the  beginning  of  this  section,  in  a  multi-cell  network  the  inter-cell  interference 
experienced  by  a  user  depends  on  many  factors,  including  the  scheduling  and  beamforming 
algorithms  used  in  the  co-channel  cells.  In  order  to  provide  satisfactory  physical  layer  QoS 
to  the  scheduled  users,  the  beamforming  algorithms  need  to  account  for  the  random  inter-cell 
interference  that  will  be  experienced  by  scheduled  users.  In  this  study  we  adopt  a  beamforming 
algorithm  proposed  in  [16]  that  is  based  on  a  closed-form  approximation  of  PER  as  a  function 
of  SINR  to  handle  the  issue  of  random  inter-cell  interference  experienced  by  the  users.  This 
algorithm  is  summarized  in  the  following  subsection. 

B.  Beamforming  algorithm  for  multi-cell  networks 

We  observe  that  most  of  the  link  curves  ( e.g [8],  [12])  can  be  fitted  using  a  function  of  the 
form 

PER(SINR)  =  1/(1  +  ek{SINRdB-z))  (6) 

where  SINRdB  is  the  SINR  in  dB,  i.e.,  SINRdB  =  101og10 (SINR),  and  k  and  z  are  two 
fitting  parameters  that  determine  the  slope  and  the  position  of  a  link  curve,  respectively. 
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Fig.  6.  Link  curves  of  a  TDMA  system. 


The  measured/collected  data  points  for  different  modulation  and/or  coding  schemes  of  a 
TDMA  system  [8]  are  plotted  as  V  in  Fig.  6,  and  the  fitting  curves  are  shown  as  solid  curves. 
One  can  clearly  see  that  these  fitting  curves  match  the  measured  link  data  very  closely.  Link 
curves  for  systems  other  than  this  TDMA  system  are  similar  in  shape  but  with  different  slopes. 
Several  example  link  curves  for  a  CDMA  system  are  given  in  [12].  The  slope  of  a  link  curve 
reflects  the  sensitivity  of  PER  to  SINR  value  and  is  determined  by  the  modulation  and/or  coding 
scheme,  packet  length,  etc. 

For  a  reasonable  target  PER  (less  than  5-10  percent),  we  can  approximate  (6)  as  follows: 

PER(SINR)  =  1/(1  +  ek(SINRdB-z >) 

«  e~KSINRdB-z)  =  ekzSINR~c *  (7) 

where  a  =  10k/  In  10. 

The  following  beamforming  algorithm  is  based  on  the  approximation  in  (7)  and  attempts  to 
adjust  the  virtual  target  PER  provided  to  the  beamforming  algorithm  (which  may  be  different 
from  the  desired  target  PER)  in  such  a  way  that  the  achieved  PER  equals  the  desired  target 
PER.  We  refer  the  interested  readers  to  [16]  for  details  of  the  algorithm.  The  algorithm  consists 
of  two  subalgorithms,  feasibility _M  and  minimize _power_m.  Algorithm  feasibility _m 
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tests  the  feasibility  of  a  set  of  users  with  their  respective  transmission  rates,  and  algorithm 
minimize _power_m  minimizes  the  total  transmission  power  for  a  feasible  rate  vector. 

Algorithm  V:  FEASiBiLrrY_M(iW* ;  'Hjz  nj  ijnUr(t),  SINRJ1,  SINRJa,  Vj  e  U) 

STEP  1:  If  FEASIBILITYf/VmA  Hj ,  n|  +  iy!‘r  r(l).  7 j .  V j  dli)  =  1,  where  7 j  satisfies  eU  V  7  ■  =  / j  •  PERfarget,  output  1,  otherwise 

output  0. 


In  the  above  algorithm  the  input  variable  IJller(t)  is  the  time  average  of  the  inter-cell  inter¬ 
ference  at  user  j,  and  SINRJ1  and  SINRja  are  the  time  average  of  SIN  IN 1  and  SINR~a, 
respectively.  The  variables  kj,  Zj,  and  ay  denote  the  parameters  of  the  link  curve  corresponding  to 

_ g _  Oij  - ~  _ 

the  transmission  rate  selected  for  user  j.  The  parameter  tj  is  defined  to  be  SINRJ 1  / SINRj  “J 

[16]. 

Algorithm  VI:  MiNiMizE_powER_M(Pmaa,;  Rj,  rij  IJter(t),  SINRJ1,  SINRJa,  Vj  e  U) 

STEP  1:  MINIMIZE_POWER(Pmoa.;  "Wj.ral  +  Ilnter(t),  yj,  Vj  £  U),  where  7 j  satisfies  ekizi^-  =  tj  ■  PERtarget- 

These  algorithms  are  modifications  of  the  beamforming  algorithms  FEASIBILITY  and  MINI- 
mize_power  described  in  subsection  III-B,  and  ensure  that  the  average  PER  equals  the  desired 
target  PER  in  a  multi-cell  network  with  inter-cell  interference. 

VII.  Performance  evaluation  of  a  multi-cell  network 

For  our  simulation  with  multiple  cells,  we  use  a  reuse  pattern  of  (x,  y )  =  (1,  1).  As  with  the 
single  cell  scenarios,  we  study  the  average  packet  delay  with  varying  traffic  load  only  in  the 
center  cell  under  the  proposed  scheduling  algorithms.  The  BSs  in  the  surrounding  cells  utilize 
HEURISTIC- 1  algorithm  with  P  =  1,  and  the  traffic  load  is  assumed  to  be  constant. 

The  performance  of  HEURISTIC_1  algorithm  is  shown  in  Fig.  7.  We  observe  that  the  maximum 
stable  traffic  load  achieved  by  the  optimal  scheduling  policy  in  (3)  for  this  multi-cell  network  is 
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about  80  percent  of  that  of  a  single  cell  network  due  to  the  presence  of  inter-cell  interference, 
which  reduces  the  spatial  separability  of  the  users. 

It  is  worth  noting  that  unlike  in  the  single  cell  network,  the  maximum  stable  traffic  load 
increases  from  P  =  1  to  P  =  2.  This  is  due  to  the  inter-cell  interference  that  makes  the  users 
less  spatially  separable.  Therefore,  switching  the  order  of  the  two  users  with  largest  queue  lengths 
often  yields  different  rate  vectors  because  two  users  with  the  largest  queue  sizes  may  not  be 
scheduled  together  any  more  due  to  the  lack  of  spatial  separability.  This  increases  the  subset  of 
candidate  rate  vectors,  and  leads  to  better  performance  and  larger  throughput  region.  Algorithm 
HEURISTIC _1  with  P  =  2  and  P  =  3  has  similar  maximum  stable  traffic  load  that  is  about  90 
percent  of  that  achieved  by  the  optimal  scheduling  policy  (3),  which  is  slightly  lower  than  in  the 
single  cell  scenario  (91  -  92.5  percent).  This  performance  degradation  of  HEURISTIC_1  scheduling 
algorithm  in  a  multi-cell  network  is  due  to  the  fact  that  the  users  are  less  spatially  separable  in 
a  multi-cell  network  in  the  presence  of  inter-cell  interference.  This  oftentimes  prevents  the  users 
with  large  queue  sizes  from  being  spatially  separable,  and  as  a  result  they  cannot  be  scheduled 
together  and  sequential  scheduling  of  the  users  based  on  their  queue  lengths  may  result  in  a  rate 
vector  not  close  to  the  optimal  one  selected  by  (3). 


Fig.  7.  Average  packet  delay  vs.  traffic  load  for  HEURISTIC.l  algorithm  for  multiple  cells 
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The  performance  of  HEURISTIC_3  and  HEURISTIC_4  algorithms  is  also  shown  in  Fig.  7  for 
comparison.  The  maximum  stable  throughput  of  HEURISTIC  _3  and  HEURISTIC  _4  is  about  55 
percent  and  78  percent  of  that  of  the  optimal  scheduling  policy  (3),  respectively,  in  this  multi-cell 
scenario.  Thus,  our  proposed  algorithm  outperforms  the  algorithm  that  attempts  to  maximize  the 
instantaneous  throughput  ( i.e .,  HEURISTIC_4)  without  considering  the  queue  sizes. 

The  performance  of  HEURISTIC_2  algorithm  with  different  values  of  parameter  K  is  displayed 
in  Fig.  8.  The  maximum  stable  traffic  load  increases  with  K  as  expected.  With  K  =  4,  the 
maximum  stable  traffic  load  is  about  73  percent  of  that  achieved  by  optimal  scheduling  policy, 
compared  to  87  percent  for  the  single  cell  network.  This  is  again  due  to  the  fact  that  users  are  less 
spatially  separable  with  inter-cell  interference.  Therefore,  considering  the  feasible  rate  vectors 
only  for  a  small  number  of  users  often  selects  a  rate  vector  that  is  not  close  to  the  optimal  one 
given  by  (3). 


Fig.  8.  Average  packet  delay  vs.  traffic  load  for  HEURISTIC_2  algorithm  for  multiple  cells 


Comparing  the  figures  for  single  cell  and  multi-cell  networks,  we  observe  that  the  average 
packet  delay  increases  more  smoothly  for  the  multi-cell  network.  This  is  because  in  a  single  cell 
network  scenario  things  are  more  deterministic  due  to  the  absence  of  inter-cell  interference,  while 
in  a  multi-cell  network  scenario  the  presence  of  time  varying  inter-cell  interference  introduces 
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more  stochastic  disturbance  and  randomness  to  system  dynamics. 


A.  Spectral  efficiency  under  different  reuse  patterns 

In  this  section  we  have  fixed  the  reuse  pattern  at  (x,y)  =  (1,1)  and  studied  the  performance 
of  our  proposed  algorithms  against  that  of  the  optimal  algorithm  as  well  as  that  of  two  other 
heuristic  algorithms.  In  this  subsection  we  are  interested  in  investigating  the  spectral  efficiency 
of  the  system  under  HEURISTIC_1  algorithm  with  different  reuse  patterns.  Changing  the  reuse 


pattern  alters  the  distance  between  BSs  and 
interference. 


Maximum  stable  traffic  load  vs.  reuse  pattern 


(a) 


Fig.  9.  Plot  of  (a)  maximum  stable  load  and  (b)  maximum 


the  characteristics  of  the  random  inter-cell 


Normalized  maximum  stable  traffic  load  vs.  reuse  pattern 


(b) 


load  normalized  by  x2  +  xy  4-  y 2  for  different  reuse  patterns 


Fig.  9  plots  the  maximum  stable  load  and  the  maximum  stable  load  normalized  by  x2  +xy  +  y2 
under  different  reuse  patterns.  Fig.  9(b)  clearly  shows  that  the  normalized  maximum  stable  load 
decreases  with  the  number  of  cells  sharing  the  spectrum,  indicating  lower  spectral  efficiency.  In 
addition,  as  the  distance  between  BSs  increases,  the  inter-cell  interference  becomes  weaker  and 
the  maximum  stable  load  increases  and  approaches  that  of  a  single  cell  network.  Further,  the 
performance  gap  between  P  =  1  and  P  =  3  narrows  with  increasing  distance  between  BSs.  This 
is  due  to  the  fact  that  as  the  inter-cell  interference  gets  weaker,  the  spatial  separability  of  the  users 
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improves  and  increasing  the  number  of  candidate  rate  vectors  from  1  to  3  does  not  significantly 
improve  the  network  performance  as  mentioned  in  Section  V.  This  result  indicates  that  in  order  to 
improve  the  spectral  efficiency  a  smaller  reuse  pattern  is  preferred  under  HEURISTIC- 1  algorithm 
when  our  proposed  beamforming  algorithm  is  adopted  to  handle  the  random  inter-cell  interference 
from  co-channel  cell  BSs. 


VIII.  Discussion 

The  use  of  antenna  arrays  at  the  base  stations  has  been  proposed  to  improve  the  system 
throughput  and  to  provide  quality-of-service  (QoS)  guarantees  to  mobile  users  in  wireless  net¬ 
works.  In  this  paper  we  studied  the  problem  of  wireless  scheduling  with  base  station  antenna 
arrays  with  a  physical  layer  constraint  of  providing  certain  packet  error  rate  and  higher  layer  QoS 
guarantees  in  the  form  of  throughput.  An  optimal  scheduling  policy  that  achieves  the  throughput 
region  is  derived. 

We  have  proposed  two  heuristic  algorithms  that  attempt  to  mimic  the  behavior  of  the  optimal 
policy  with  much  lower  complexity.  Simulation  results  suggest  that  these  algorithms  yield 
significant  performance  improvement  over  other  algorithms  that  do  not  consider  queue  state  for 
scheduling  decisions.  The  first  proposed  algorithm  is  shown  to  achieve  the  schedulable  region 
close  to  the  throughput  region  with  linear  complexity  in  the  number  of  candidate  vectors  and  the 
number  of  users,  whereas  the  complexity  of  the  optimal  policy  increases  exponentially  with  the 
number  of  users.  Furthermore,  simulation  results  indicate  that  the  number  of  candidate  vectors 
required  to  enjoy  most  of  the  benefits  is  close  to  the  number  of  antenna  elements  at  the  base 
stations,  which  could  be  orders  of  magnitude  smaller  than  the  number  of  all  feasible  rate  vectors. 
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Appendix  I 

Proof  of  Proposition  1 


In  order  to  prove  the  proposition  we  first  show  that  the  throughput  region  achieved  by  stationary 
scheduling  policies  that  consider  only  the  channel  state  S  is  given  by  the  throughput  region  stated 
in  the  proposition.  Then,  we  prove  that  restricting  the  scheduling  policies  to  such  scheduling 
policies  does  not  reduce  the  throughput  region. 

Define 


1  f 

lim  inf  -  ^  dj 

T— 1 


t—¥  OO  f 


T) 


(8) 


where  r,(£)  is  the  transmission  rate  for  user  j  in  timeslot  t,  dj(t )  =  rj(t)(  1  —  /'(/)).  r;(R) 
denotes  the  j-th  element  of  the  vector  Rt1/xi,  and  /}(t)  is  an  indicator  function 

{1,  if  user  /’s  transmission  in  timeslot  t,  is  unsuccessful 
0,  otherwise 

These  indicator  functions  {/}(£);  t  =  1,  2, . . .}  are  given  by  a  sequence  of  i.i.d.  Bernoulli  rvs  with 
E  [/}(£)]  =  PER.  The  indicator  functions  {/•(/):/  =  1,2,...},  j  =  1,2,...,./,  for  different 
users  are  assumed  to  be  mutually  independent. 

Lemma  1:  Consider  a  single  queue  j ,  j  =  1,2 ,...,/.  A  necessary  condition  for  queue  j  to 
be  stable  is  Aj  <  D_j-  Moreover,  if  the  departure  process  {dj(t);  1  =  1.2....}  is  given  by  a  finite 
state,  ergodic  Markov  chain,  then  a  sufficient  condition  for  stability  is  Aj  <  Dj. 

Proof:  It  is  well  known  that  with  Markovian  arrival  and  departure  processes  a  sufficient 
condition  for  the  queues  to  be  stable  is  Aj  <  Dj  [10],  [11].  Hence,  here  only  prove  that  Aj  <  Dj 
is  a  necessary  condition. 
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Suppose  that  A?  >  D_r  Select  e  >  0  such  that  A,  —  D_j  —  2e  >  0.  We  can  find  a  subsequence 
{tj},  where  i,  — >■  oo,  such  that  for  all  i, 


EtU 

u 


>Aj-e 


JXi  dj(r) 

U 


<  Dj  +  e  . 


Then,  it  is  easy  to  see  that  the  queue  size  Xj(tj)  satisfies 


x 


■<{U)  =  J2  ai(T )  “  £  clM)  >  (A;  “  D,  -  2e)ti  for  all  . 

T=  1  T=1 


Define  a  :=  Ay  —  Dj  —  2e,  and  let  T,  denote  the  additional  time  it  takes  for  the  queue 
size  Xj(t)  to  drop  below  a  threshold  value  c,  starting  at  the  value  Xj(U)  in  timeslot  Clearly 
Tj  >  (atj.  —  c)  / v max,  where  vmax  =  max  V  is  the  largest  transmission  rate  available.  Thus,  at 
time  t.j  +  Tj  the  fraction  of  time  the  queue  size  exceeds  c  is  lower  bounded  by  Tj/(tj  —  T, ) , 
which  is  greater  than  or  equal  to  (at,  —  c) / (at j  —  c  +  vmaxtj ).  Therefore, 

1  f 

lim  sup  -  ^2  1  [xj (r)>c]  >  lim  (atj  -  c)/(atj-c  +  vmaxU) 

t-^OO  t  T—l  t— >oo 


aj {a  +  vmax)  . 


(9) 


Since  (9)  is  true  for  all  c  >  0, 

1  # 

Jjjfi  Imi  Slip  ,  l[.Cj(r)>c]  T  a/ (a  +  V maoc )  ^  0  , 

c^°°  t-Kx>  1  T=i 

and  the  system  is  not  stable.  ■ 

Using  a  stationary  scheduling  policy  that  utilizes  only  the  channel  state  information  leads  to 
a  departure  process  {clj(t)\t  =  1,2,...}  produced  by  a  Markov  chain  for  all  queues  j,  with  an 
average  rate  given  by  the  right  hand  side  of  the  condition  in  (2).  Therefore,  Lemma  1  ensures 
stability  when  the  arrival  vector  A  satisfies  the  condition  in  Proposition  1. 

We  now  proceed  to  prove  that  the  condition  in  (2)  is  a  necessary  condition  even  when  the 
above  restriction  on  the  scheduling  policy  is  removed.  Suppose  that  all  queues  can  be  stabilized 
with  some  scheduling  policy.  Then,  from  the  proof  of  Lemma  1  a  necessary  condition  for  stability 
is  that  Aj  <  D_j  for  all  users  j  G  {1,  2,  •  •  • , ./},  where  D_j  is  defined  in  (8). 
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1(SR,  t)  = 


Define  1(S,£)  to  be  indicator  function  of  the  event  that  channel  is  in  state  S  in  timeslot  t, 
i.e.,  1(S,£)  =  1  if  S (t.)  =  S  and  0  otherwise,  and 

1,  if  channel  is  in  state  S  and  a  rate  matrix  R  is  selected  in  timeslot  t 
0,  otherwise 

Fix  e  >  0.  There  exists  sufficiently  large  i  such  that 

E'  VnS:7S  <  'Ts  I  f  •  Uj  <  |E'  ,  d;i(r)  I  r  ,  and 
E <  x  _  PER  e  for  all  ■  =  1  2  . .  j  . 

E-  ,  TSR,t)  - 


Therefore,  for  each  j  (1  {1,2,...,./}  we  have 

=  Dj<  i  E  dAr)  +  e 

1  T=  1 


t 


T— 1 


=  E 

ses 


Et=,i(S.T) 


E„il(S.r)„ 


E  l(S,r)r,(R(r))(l  -  IJ(r))  +  e 


E  EMSR,r)r3-(R)(l-/;(r))  +  E 


<  l S(7rs  +  e)EU  l(S,r)  rEs  t=i 

=  E(^s  +  e)  E  E-ll(SR")?-(R)(l-^))+e 

Se5  Res  EUl(S,r) 

V-/  ,  E*=il(SR,r)EUl(SR,r)ri(R)(l-/?(r))  , 

=  E(^s  +  e)  e  i  , - , - +  e  ’ 

ses  Res  ,Lr=i  Mo,  rj 


e;=1i(sr,t) 


where  r,(R)  is  the  j-th  element  of  the  vector  RT1/Xi. 
Define 

Er=l  1(SR,  T) 


CsR{t)  — 


EUl(S-r) 


Then, 


V El's  +«)  E  CsR(i)r,(R) - — | — — — — - h  t 

ses  Res  Er= iRoK,  rj 

<  E^s  +  e)  EC'sR(t>j(R)(l-P^R  +  e)+e 

ses  Res 

<  E  E  ^sr (£)?j(R)(l  —  PER)  +  e  +  evmax  +  e\S\vmax(l  —  PER  +  e) 

s  es  Re  s 

=  E  ^  E  ^sr (£)r,(R)(l  —  PER )  +  e(l  +  vmax  +  \S\vmax(l  —  PERT  e)) 

ses  Res 
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Since  e  can  be  arbitrarily  close  to  zero,  this  completes  the  proof. 

Appendix  II 

Proof  of  Proposition  2 

We  follow  the  same  fluid  limit  approach  first  shown  in  [9]  and  used  in  [1],  Let 

Y  =  (Y (t),t  >  0)  :=  ({((^(f), . . . ,  r;X  /  (/)).,/  G  jy  S (<)},  t  >  0)  (10) 

denote  the  stochastic  process  describing  the  behavior  of  the  system,  where  if)  is  the  waiting 
time  of  the  A>th  packet  in  queue  j  and  Xj(t)  is  the  number  of  backlogged  packets  in  queue  j. 
We  define  the  norm  of  the  state  Y(f)  to  be  ||Y(i)||  :=  +  W j{t)),  where  W j(t)  is 

the  waiting  time  of  user  j’s  head-of-line  packet.  Let  {Y^n\n  =  1,2,...}  denote  a  sequence  of 
processes,  where  Y(n)  has  an  initial  condition  that  satisfies  Y'"j(0)  =  n.  Theorem  4  in  [1, 

p.  200]  tells  us  that  the  random  process  Y  in  (10)  is  stable  if  there  exists  e  >  0  and  an  integer 
T  >  0  such  that,  for  any  sequence  of  processes  {Y^n\  n  =  1,  2, . . .},  we  have 

lira  sup  E  [-  Y {n\nT)  1  <  1  -  e  .  (11) 

n— s-oo  L  Tl 

Using  this  result,  we  prove  that  a  family  of  scheduling  policies  that  select  a  rate  matrix 
argmaxReS(q  X(f)T E(HTlIxl)  in  timeslot  t,  where  3  =  diag (£,,  j  G  J),  is  throughput  optimal 
for  any  positive  constants  j  (1  J ,  and  the  system  is  stable  for  any  arrival  rate  vector  A  G 
int(A).  Note  that  our  proposed  scheme  is  a  special  case  with  =  1  for  all  j  G  J . 

We  first  introduce  the  following  processes: 

.  Fj(t)  -  the  total  number  of  user  j’s  packets  that  have  arrived  up  to  time  t,  including  those 
in  the  queue  at  time  t  =  0. 

•  Dj(t )  -  the  total  number  of  user  j’s  packets  that  have  been  successfully  transmitted  up  to 
time  t. 

•  Gs{t)  -  the  number  of  timeslots  before  time  t  when  the  channel  state  was  S. 

•  (jsr(A)  ~  the  number  of  timeslots  before  time  t  when  the  channel  state  was  S  and  the  rate 


matrix  R  G  S  was  selected. 
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.  UJ(i)=mf{S<i|FJ(S)>DJ(<)} 

For  process  Y(">  we  assume  that  packet  arrivals  begin  at  time  t  =  —  n  although  the  packet 
service  begins  at  time  t  =  0.  For  notational  simplicity  we  use 

Z  :=  {((//•  /  )j  ■  X ; .  U ; .  W; ) . ./  G  J),((Gs,Sg  5),(Gsr,S  G  5;Rg  S))}  . 

Here  the  processes  F-n\  and  U^'*  are  defined  on  [— n,  oo)  [l].3  Note  that  a  sample  path 
of  random  process  Z  uniquely  defines  that  of  random  process  Y. 

We  denote  by  z!/n)  (t)  the  scaled  process  ^Z(n}(nf  ).  The  following  lemma  tells  us  that,  for 
any  sequence  {Z^n\n  =  1,2,...},  there  exists  a  subsequence  {Z^k\k  G  /C  C  IN},  where 
N  =  {1,2,...},  such  that  the  scaled  process  converges  to  a  fluid  process.  Due  to  the  space 
constraint,  we  omit  the  proof  of  Lemmas  2  and  3. 

Lemma  2:  With  probability  one,  for  any  sequence  of  processes  {Z  ^ ,  n  =  1,2,.. .},  there 
exists  a  subsequence  {Z^k\  k  G  /C  C  IN}  such  that,  as  k  — >■  oo,  the  following  convergence  holds 
for  the  scaled  subsequence  {z^k\  k  G  1C}: 

( fjk]{t),t>  -1)  =>  ( fj{t),t>  -1)  ,  0)  (fj{t),t  >  0) 

>  0)  -^u.o.c.  (dj(t),t  >  0)  ,  (x^)(t),t  >  0)  -^u.o.c.  (Xj(f),f  >  0) 

{9s\t)^>  0)  ->■ u.o.c.  (gs{t),t  >  0)  ,  {gsi{t),t  >  0)  -^u.o.c.  {gsR(t),t>  0) 

(u >  0)  =>•  (u j(f),f  >  0)  ,  (w f\t),t  >  0)  =>•  (w j(t),t  >  0) 

where  =>•  denotes  convergence  at  every  continuity  point  of  the  corresponding  limit  function,  and 
u.o.c.  denotes  uniform  convergence  on  compact  intervals. 

The  limit  functions  fj,dj,x.j,gs,gsR,'Uj,  and  w?  satisfy  the  following  properties:  (i)  fj  are 
right  continuous  with  left  limits  (RCLL),  non-negative,  and  non-decreasing  in  [-1,  oo),  (ii) 
fj .  dj .  gs ,  9 SR.  are  non-negative,  non-decreasing  and  Lipschitz  continuous  in  [0,  oo),  (iii)  x  ; 

’Here  we  extend  the  discrete  time  processes  to  continuous  time  processes  by  assuming  that  the  values  of  the  processes  are 


constant  over  [f,  t  +  1). 
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are  continuous  in  [0,  oo),  (iv)  Uj  are  non-decreasing  and  RCLL  in  [0,  oo),  and  (v)  Wj  are 
non-negative  and  RCLL  in  [0,  oo). 

In  addition,  the  limit  function  z  =  (/,  d ,  g,  g,  x,  u,  w)  satisfies  the  following  properties  for  all 
S  G  <S  and  R  G  S: 

E  /i(°)  <  1  «  hi*)  ~  /j(°)  =Ajt,  t  >  0  ,  xj(t)  =  fj{t)  -  dj(t)  ,  t  >  0. 

jej 

For  any  interval  [ti,t2\  C  [0,  oo), 

dj(^)  -  <E  E  rj{R){gSR{t2)  -  gsnih))  (12) 

Se5  Res 

with  equality  holding  if  x.j{t)  >  0  for  all  t  G  [/  [ .  U].  Further,  if  dj(t\)  >  /)■(())  for  some 
t\  G  [0,  oo),  then  for  all  t  G  [£i,  oo), 

AjWj{t)  =  Xj(t)  .  (13) 

We  define  regular  points  to  be  the  points  t  G  [0,oo)  at  which  the  derivatives  of  the  limit 
functions  fj  ( • ) ,  dj  ( • ) ,  ( ■ ) ,  ^sr ( • )  and  Xj  ( • )  exist. 

Lemma  3:  With  probability  one,  limit  function  z  satisfies  the  following  property:  Suppose 
that 


E  i,x,(/jr;!R!  <  max  E  l/yfR/) 
jej  Resjej 

for  some  S  and  R  G  S  at  some  regular  point  t  >  0.  Then,  g'SR{t)  =  0. 
We  now  introduce  the  following  quadratic  Lyapunov  function. 


L(x)  =  9  E  0X 

-  jej 


for  a  vector  x  =  (xi, . . . ,  xj). 


Lemma  4:  The  limit  function  z  satisfies  the  following:  (i)  L(x(t)),f  >  0,  is  an  absolutely 
continuous  function,  and  (ii)  for  any  (5!  >  0,  there  exists  S2  >  0  such  that,  with  probability  one, 
at  any  regular  point  t,  L(x(t))  >  d'i  implies  AL(x(t))  <  —S2. 


Proof:  For  notational  simplicity  we  assume  PER  =  0,  i.e.,  all  packet  transmissions  are 
successful,  in  this  proof.  Let  7 Z  :=  UsgsS  be  the  set  of  all  possible  rate  matrices  from  which 
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the  scheduler  can  choose,  r  :=  \R\,  and  s  :=  |<5|  <  oo.  Let  $  denote  an  s  x  r  stochastic  matrix 
such  that  <f>SR  is  the  probability  that  rate  matrix  R  C  'R  is  selected  when  the  channel  state  is  S 
and  Y,ReTi  $sr  =  1-  We  assume  that  <f>SR  =  0  if  R  ^  S.  Given  a  stochastic  matrix  <f>,  let 


«j($)  =  EffsE  $srg(r)  • 

ses  Res 

At  any  regular  point  t, 

±LW))  =  E  ~  dj(t)) 

01  jej 

=  E  s/x/!/)!  b  -  %($))  +  E  s>'a/)0(<I')  -  E  s/x,':/)'';!<I)!/)! 

jej  jej  jej 

where  $  is  a  stochastic  matrix  corresponding  to  a  stationary  scheduling  policy  that  satisfies 


—  A  >  e  ■  1  Jxl  for  some  s  >  0,  $Sr (f)  =  S'srI^As.  and  the  last  equality  follows  from 


=  E  E  G(R)#sr(*)  from  (12)- 

ses  Res 

The  existence  of  such  3>  is  guaranteed  since  A  G  int(A). 

Let  (53  >  0  be  a  constant  such  that  L(x)  >  A |  implies  maxjej  x?  >  S3.  Then, 

E  tjXjWAj  ~  r./'<1,!i  <  -(min^j)  •  S3  •  min(^(<§)  -  Aj) 

jej  jeJ 

<  -  (min  A)  •  S3  ■  s  :=  -S2  ■ 
jej 

Now  in  order  to  complete  the  proof  it  suffices  to  show 

E  s,x,!/)':/!<I)!/!)  >  E  ZjXj(t>t{®)  ■ 

jej  jej 

Define 


/\  ( ‘L. x i  :=  E  £/x/r/(<1’) 

jej 

=  E  s,x,  E  ~s  E  $sr g(r) 

jej  s  Res 

=  E'»E*srEW;W-  (14) 

s  Res  jej 

From  (14)  it  is  easy  to  see  that,  for  any  non-negative  vector  x_,  a  stochastic  matrix  <f>  maximizes 
K($>,x)  if  and  only  if  TSr  =  0  if  }',  j  ZjZji'jCR.)  <  maxR/eS  Y.jej  £j£-jrj{R')-  This  property 
is  satisfied  for  x  =  x(f)  and  <f>  =  $(f)  from  Lemma  3,  and  the  lemma  follows.  ■ 
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An  immediate  consequence  of  Lemma  4  is  that,  for  any  S  >  0,  there  exists  T  >  0  such  that 
with  probability  one,  L(x(t.))  <  5,  t  >  T. 

We  now  proceed  with  the  proof  of  the  proposition.  Lemmas  2-4  state  that,  for  any  ex  >  0 
we  can  find  a  large  enough  integer  T  >  0  such  that,  for  any  sequence  of  processes  {Z  ^ ,  n  = 
1,2,...},  there  exists  a  subsequence  {Z^k\k  C  /C  C  IN}  such  that,  with  probability  one,  the 
convergence  to  the  limit  function  takes  place  (Lemma  2)  and  Y.jejxj{T)  <  G  (Lemma  4).  If  X 
is  sufficiently  large,  it  follows  dj(T)  =  fj{T)  —  xy (7j  >  fj{T)  —  g  >  /}(())  for  all  j  G  J .  This 
implies  by  (13)  that  w j(T)  =  xy (T) /.ly  for  all  j  G  J”.  Since  g  is  small,  this  in  turn  implies 
that 


E  xj(T)  +  E  wl(T)  <  (!  +  1/ (min  Aj))ei 
G  j-  J  3 


:=  1  -  e  <  1 


Thus,  with  probability  one 


limsup—  ||Y^(nT) 


n— >oo  TL 


<  l~e  , 


(15) 


and  the  proposition  follows  from  (11). 


